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Abstract 

In this paper, an expression for the asymptotic growth rate of the number of small linear-weight 
codewords of irregular doubly-generalized LDPC (D-GLDPC) codes is derived. The expression is 
compact and generalizes existing results for LDPC and generalized LDPC (GLDPC) codes. Ensembles 
with check or variable node minimum distance greater than 2 are shown to be have good growth rate 
behavior, while for other ensembles a fundamental parameter is identified which discriminates between 
an asymptotically small and an asymptotically large expected number of small linear-weight codewords. 
Also, in the latter case it is shown that the growth rate depends only on the check and variable nodes 
with minimum distance 2. An important connection between this new result and the stability condition 
of D-GLDPC codes over the BEC is highlighted. Such a connection, previously observed for LDPC and 
GLDPC codes, is now extended to the case of D-GLDPC codes. Finally, it is shown that the analysis 
may be extended to include the growth rate of the stopping set size distribution of irregular D-GLDPC 
codes. 
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I. Introduction 

Recently, low-density parity-check (LDPC) codes have been intensively studied due to their 
near-Shannon-limit performance under iterative belief -propagation decoding. Binary regular LDPC 
codes were first proposed by Gallager in 1963 d. In the last decade the capability of irregular 
LDPC codes to outperform regular ones in the waterfall region of the performance curve and 
to asymptotically approach (or even achieve) the communication channel capacity has been 
recognized and deeply investigated (see for instance Q, 0, flU, Q, [0, 0). 

It is usual to represent an LDPC code as a bipartite graph, i.e., as a graph where the nodes are 
grouped into two disjoint sets, namely, the variable nodes (VNs) and the check nodes (CNs), such 
that each edge may only connect a VN to a CN. The bipartite graph is also known as a Tanner 
graph (HI. In the Tanner graph of an LDPC code, a generic degree-g VN can be interpreted 
as a length-g repetition code, as it repeats q times its single information bit towards the CNs. 
Similarly, a degree-s CN of an LDPC code can be interpreted as a length-s single parity-check 
(SPC) code, as it checks the parity of the s VNs connected to it. 

The growth rate of the weight distribution of Gallager's regular LDPC codes was investigated 
in ITJ. The analysis demonstrated that, provided that the smallest VN degree is at least 3, the 
ensemble has good growth rate behavior, i.e. a code randomly chosen from the ensemble contains 
an asymptotically small expected number of small linear-weight codewords. 

More recently, the study of the weight distribution of binary LDPC codes has been extended 
to irregular ensembles. Pioneering works in this area are flU, iflOl . ifTTI . In ifTTl a complete 
solution for the growth rate of the weight distribution of binary irregular LDPC codes was 
developed. One of the main results of IfTTI is a connection between the expected behavior of the 
weight distribution of a code randomly chosen from the ensemble and the parameter A'(0)//(1), 
X(x) and p(x) being the edge-perspective VN and CN degree distributions, respectively. More 
specifically, it was shown that for a code randomly chosen from the ensemble, one can expect 
an exponentially small number of small linear- weight codewords if < A'(0)p'(l) < 1, and an 
exponentially large number of small linear-weight codewords if A'(0)p'(l) > 1. 

This result establishes a connection between the statistical properties of the weight distribution 
of binary irregular LDPC codes and the stability condition of binary irregular LDPC codes over 
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Fig. 1. Structure of a D-GLDPC code. 



the binary erasure channel (BEC) 0, [|4||. If q* denotes the LDPC asymptotic iterative decoding 
threshold over the BEC, the stability condition states that we always have 

q* < [AW (IF 1 . (1) 

Prior to the rediscovery of LDPC codes, binary generalized LDPC (GLDPC) codes were 
introduced by Tanner in 1981 10. A GLDPC code generalizes the concept of an LDPC code 
in that a degree- s CN may in principle be any (s,h) linear block code, s being the code length 
and h the code dimension. Such a CN accounts for s — h linearly independent parity-check 
equations. A CN associated with a linear block code which is not a SPC code is said to be a 
generalized CN. In [8] regular GLDPC codes (also known as Tanner codes) were investigated, 
these being GLDPC codes where the VNs are all repetition codes of the same length and the 
CNs are all linear block codes of the same type. 

The growth rate of the weight distribution of binary GLDPC codes was investigated in |[T2ll . 
[fT3l . [fT4|. |[T5l . In |fl"2| the growth rate is calculated for Tanner codes with BCH check component 
codes and length-2 repetition VNs, leading to an asymptotic lower bound on the minimum 
distance. The same lower bound is developed in lfT3l assuming Hamming CNs and length-2 
repetition VNs. Both works extend the approach developed by Gallager in 03 Chapter 2] to 
show that these ensembles have good growth rate behavior. The growth rate of the number of 
small weight codewords for GLDPC codes with a uniform CN set (all CN of the same type) and 
an irregular VN set (repetition VNs with different lengths) is investigated in [14]. It is shown 
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that the ensemble has good growth rate behavior when either the uniform CN set is composed 
of linear block codes with minimum distance at least 3, or the minimum length of the repetition 
VNs is 3. On the other hand, if the minimum distance of the CNs and the minimum length of 
the repetition VNs are both equal to 2, the goodness or otherwise of the growth rate behavior of 
the ensemble depends on the sign of the first order coefficient in the growth rate Taylor series 
expansion. The results developed in 031 were further extended in lfT5l to GLDPC ensembles 
with an irregular CN set (CNs of different types). It was there proved that, provided that there 
exist CNs with minimum distance 2, a parameter A'(0)C, generalizing the parameter A'(0)p'(l) 
of LDPC code ensembles, plays in the context of the weight distribution of GLDPC codes the 
same role played by A'(0)p'(l) in the context of the weight distribution of LDPC codes. The 
parameter C is defined in Section |nij 

Interestingly, this latter results extends to binary GLDPC codes the same connection between 
the statistical properties of the weight distribution of irregular codes and the stability condition 
over the BEC. In fact, it was shown in lfT6l that the stability condition of binary irregular GLDPC 
codes over the BEC is given by 

q*< (A'(O)Cr 1 . (2) 

Generalized LDPC codes represent a promising solution for low-rate channel coding schemes, 
due to an overall rate loss introduced by the generalized CNs ifTTl . Doubly-generalized LDPC 
(D-GLDPC) codes generalize the concept of GLDPC codes while facilitating much greater design 
flexibility in terms of code rate lfT8l (an analogous idea may be found in the previous work |fl9lQ . 
In a D-GLDPC code, the VNs as well as the CNs may be of any generic linear block code types. 
A degree-g VN may in principle be any (q, k) linear block code, q being the code length and 
k the code dimension. Such a VN is associated with k D-GLDPC code bits. It interprets these 
bits as its local information bits and interfaces to the CN set through its q local code bits. A 
VN which corresponds to a linear block code which is not a repetition code is said to be a 
generalized VN. A D-GLDPC code is said to be regular if all of its VNs are of the same type 
and all of its CNs are of the same type and is said to be irregular otherwise^. The structure of 
a D-GLDPC code is depicted in Fig. [Q 

'Note that VNs associated with different representations of the same linear block code (i.e. with different generator matrices) 
are regarded as belonging to different types. 
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A related class of bipartite-graph codes was considered in [|20| where both CNs and VNs 
were generalized, but where the code bits were associated directly with the edges of the Tanner 
graph (and thus the generator matrices associated with VNs were irrelevant). In this work it was 
shown that in certain regular code ensembles with the same local code of minimum distance > 3 
at every CN and VN, asymptotically good codes exist in the ensemble which meet the Gilbert- 
Varshamov bound. These ensembles are generalizations of expander code ensembles [|2TT|. Also, 
[|22| presented similar results in the context of regular hypergraph codes with random coding 
at the nodes (for a fixed hypergraph), random hypergraphs (with the same local code at every 
CN and VN), and random selection of both hypergraph and local codes. Also, 11231 investigates 
the asymptotic weight enumerators of many LDPC-like codes including turbo codes and repeat- 
accumulate codes. 

In this paper the growth rate of the weight distribution of binary irregular D-GLDPC codes 
is analyzed for small weight codewords. It is shown that a given irregular D-GLDPC code 
ensemble has good growth rate behavior when there are no VNs with minimum distance 2, 
and likewise when there are no CNs with minimum distance 2. It is also shown that, in the 
case where there exist both VNs and CNs with minimum distance 2, a parameter 1/P _1 (1/C) 
discriminates between an asymptotically small and an asymptotically large expected number 
of small linear-weight codewords (the function P(x) is defined in Section [Til]). The parameter 
1/P~ 1 (1/C) generalizes the above mentioned parameters A'(0)p / (1) and A'(0)C to the case where 
both generalized VNs and generalized CNs are present. The obtained result also represents the 
extension to the D-GLDPC case of the previously recalled connection with the stability condition 
over the BEC. In fact, it was proved in [fT6l Theorem 2] that the stability condition of D-GLDPC 
codes over the BEC is given by 

q*<P-\l/C). (3) 

The paper is organized as follows. Section fll] defines the D-GLDPC ensemble of interest, 
and introduces some definitions and notation pertaining to this ensemble. Section |In] defines 
further terms regarding the VNs and CNs which compose the D-GLDPC codes in the ensemble. 
Section [IV] presents the main result of the paper regarding the growth rate of the weight 
distribution, together with several corollaries. Section |V] proves this main result, and Section IVTl 
concludes the paper. 
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II. Irregular Doubly-Generalized LDPC Code Ensemble 

We define a D-GLDPC code ensemble M. n as follows, where n denotes the number of VNs. 
There are n c different CN types t E I c = {1, 2, • • • , n c }, and n v different VN types t E I v = 
{1, 2, ■ ■ • ,n v }. For each CN type t E I c , we denote by h t , s t and r t the CN dimension, length 
and minimum distance, respectively. For each VN type t E I v , we denote by k t , qt and p t the 
VN dimension, length and minimum distance, respectively. For t E I c , p t denotes the fraction 
of edges connected to CNs of type t. Similarly, for t E I v , Xt denotes the fraction of edges 
connected to VNs of type t. Note that all of these variables are independent of n. 

The polynomials p(x) and X(x) are defined by 

and 

\(x) = X t x qt ~ l . 
tei v 

If E denotes the number of edges in the Tanner graph, the number of CNs of type t E I c is then 
given by Ep t /s t , and the number of VNs of type t E I v is then given by EX t /q t . Denoting as 
usual Jq p(x) dx and J 1 X(x) dx by J p and J X respectively, we see that the number of edges 
in the Tanner graph is given by 

and the number of CNs is given by m = E J p. Therefore, the fraction of CNs of type t E I c is 
given by 

it = ^r (4) 

and the fraction of VNs of type t E I v is given by 

St = ~~rr ■ (5) 
VtjX 

Also the length of any D-GLDPC codeword in the ensemble is given by 

Note that this is a linear function of n. Similarly, the total number of parity-check equations for 
any D-GLDPC code in the ensemble is given by 

m \ ^ Pt(st — h t ) 
M = -=- 



tel 
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A code in the irregular D-GLDPC ensemble then corresponds to a permutation on the E edges 
connecting CNs to VNs. The design rate of the D-GLDPC ensemble is given by 

M_ E K ,,P»(i-fi.) 

where for t E I c (resp. t G /„), i?t is the local code rate of CNs (resp. VNs) of type t. Each 
code in the ensemble has a code rate larger than or equal to R. 

The growth rate of the weight distribution of the irregular D-GLDPC ensemble sequence 
{A4 n } is defined by 

G(a) = lim -logE Mn [N an ] (8) 

n— >oo TL 

where E, Mn denotes the expectation operator over the ensemble Ai n , and N w denotes the number 
of codewords of weight w of a randomly chosen D-GLDPC code in the ensemble M. n . The limit 
in ® assumes the inclusion of only those positive integers n for which an E TL and E^ n [N an ] is 
positive (i.e., where the expression whose limit we seek is well defined). Note that the argument 
of the growth rate function G(a) is equal to the ratio of D-GLDPC codeword length to the 
number of VNs; by ©, this captures the behaviour of codewords linear in the block length, as 
in HB for the LDPC case. 

Definition 2.1: Let G(a) be the growth rate of the weight distribution of an irregular D- 
GLDPC ensemble sequence. The critical exponent codeword weight ratio is defined as a* = 
inf{a > | G(a) > 0}. Also, the ensemble sequence is said to have good growth rate behavior 
if a* > 0, and bad growth rate behavior if a* = 0. 

Thus an irregular D-GLDPC code ensemble sequence has good growth rate behavior if and only 
if it contains an asymptotically small expected number of small linear-weight codewords. Note 
that an ensemble with good growth rate behavior must necessarily contain asymptotically good 
code sequences. The present definition of the critical exponent codeword weight ratio may also 
be found in [|27l . 

We next define the concepts of assignment and split assignment. The concept of assignment 
was used in ifTTTl to develop an expression for the growth rate of the weight distribution of 
irregular LDPC code ensembles. The concept of split assignment is introduced in this paper. 

Definition 2.2: An assignment is a subset of the edges of the Tanner graph. An assignment 
is said to have weight k if it has k elements. An assignment is said to be check-valid if the 



following condition holds: supposing that each edge of the assignment carries a 1 and each of 
the other edges carries a 0, each CN recognizes a valid local codeword. 

Definition 2.3: A split assignment is an assignment, together with a subset of the D-GLDPC 
code bits (called a codeword assignment). A split assignment is said to have split weight (u, v) if 
its assignment has weight v and its codeword assignment has u elements. A split assignment is 
said to be check-valid if its assignment is check-valid. A split assignment is said to be variable- 
valid if the following condition holds: supposing that each edge of its assignment carries a 1 and 
each of the other edges carries a 0, and supposing that each D-GLDPC code bit in the codeword 
assigment is set to 1 and each of the other code bits is set to 0, each VN recognizes a local 
input word and the corresponding valid local codeword. 

Note that for any D-GLDPC code, there is a bijective correspondence between the set of D- 
GLDPC codewords and the set of split assignments which are both variable-valid and check-valid. 

III. Further Definitions and Notation 
The weight enumerating polynomial for CN type t E I c is given by 

u=0 



1 + £ Agx* 



u=r t 



Here Au > denotes the number of weight-w codewords for CNs of type t. Note that A$ > 
for all t E I c . Also, for each t E I c , corresponding to the polynomial A^\x) we denote the sets 

U t = {i E PJ : Af > 0} (9) 

and 

U t - = U t \{0} . (10) 
The bivariate weight enumerating polynomial for VN type t E I v is given by 

B®(x,y) = EE^V 

u=0 v=0 

k t qt 

«=1 v=pt 
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Here Bu\ > denotes the number of weight-f codewords generated by input words of weight u, 
for VNs of type t. Also, for each t G I v , corresponding to the polynomial B®(x, y) we denote 
the sets 

S t = {(i,j)etf l : Bg>0} (11) 

and 

Sr = S t \{(0,0)} . (12) 

We also define 

S- = iWT • (13) 

We denote the smallest minimum distance over all CN types by 

r = min{r t : t G J c } > 2 
and the set of CN types with this minimum distance by 

X c = {tel c : r t = r] . 

We define the parameter 

^ = r/(r-l) (14) 
and note that we have 1 < < 2 with equality if and only if r = 2. We define the parameter 

c = r y ^->o. as) 

z — ' St 

We also define f as the smallest integer i > r such that there exists some CN with a non-zero 
number of weight-z codewords: 

f = min{z > r : > for some t & I c and i G f/ t - } . (16) 

The parameter r represents the second smallest minimum distance over all CN types. 
Similarly, we denote the smallest minimum distance over all VN types by 

p = min{p t : t G I v } > 2 

and the set of VN types with this minimum distance by 

X v = {t G I v : Pt=p] ■ 
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We also define p as the smallest integer j > p such that there exists some VN with a non-zero 
number of weight- j codewords: 

p = min{j > p : B^j > for some t E I v and (z, j) G S^"} . (17) 

The parameter p represents the second smallest minimum distance over all VN types. 
For each E S~, define 



and define the parameter 



and the set 



Tij = 3 —^- , (18) 



T= min {Ttj} (19) 



F„ = < t E I v : min {T id } = T 



We also define the parameter 



X= min {(T M -T)z} . (20) 

(i,j)&S-:T^>T 



Since 1 < ip < 2 with equality if and only if r = 2, and j > p > 2 for all E S , it follows 
that T > with equality if and only if r = p = 2. Also, for t E Y v , define 

P t =\(t,j)ESi : 3 —^ = t\ . (21) 



i 

Note that in the specific case r = p = 2, we have T = and Y„ = X v , and we may write 
P t = {(i, 2) : «G L t } where L t = {z G N : > 0} for each t E X v - note that these sets 
are nonempty. 

We define the polynomials 

<w*) = £7 E "(-)" ■'■' ^ 

teY v qt {i tj )eP t V 7 

and 

w=E7 e ttgw^rv (23) 

Since all of the coefficients of Qi(x) and Q2(x) are positive, these polynomials are both 
monotonically increasing on [0, oo) and therefore their inverses, denoted by Q^ 1 (x) and Q^ix) 
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respectively, are well-defined and unique on this interval. Note that in the case r = p = 2, we 
have 

Q l {x) = C-P{x) 



where 




(24) 



Also note that in the case r = p = 2, (fT5l) becomes 




(25) 



t&x c 



and we define 



V = 2 y > (26) 



as the counterpart of the parameter C in the variable node domain. Here = Y^i&L t -^2 ^ s 



the total number of weight-2 codewords for VNs of type t. Note that in this case the parameter 
C depends only on the CNs with minimum distance 2, and the parameter V and the polynomial 
P(x) depend only on the VNs with minimum distance 2. Also note that while the polynomial 
P(x) given by (124)) depends on the VN representations (i.e. generator matrices), the parameter 
V given by (|26|) does not. 

Throughout this paper, we make use of the following standard notation. Let g(x) be a 
nonnegative real- valued function, and let f(x) be a real- valued function. We say that f(x) 
is O (g(x)), writing f(x) ~ O (g(x)), if and only if there exist positive real numbers k and e, 
both independent of x, such that 



Let a{n) and b(n) be two real-valued sequences, where b(n) ^ for all n, and let q(n) = 
a{n)/b{n). We say that a{n) is asymptotically equal to b{n) as n — > oo, writing a{n) w b(n), if 
and only if lim^oo q(n) = 1. 

Finally, throughout this paper, the notation e = exp(l) denotes Napier's number. 



f(x)\<kg(x) V0<x<e. 
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IV. Growth Rate for Doubly-Generalized LDPC Code Ensemble 



The following theorem constitutes the main result of the paper. 

Theorem 4.1: Consider an irregular D-GLDPC code ensemble sequence M. n . The growth 
rate of the weight distribution is given by 



G(a) = — a log a + a 



log 



-log 



i 



+ 0(a $ ) 



where 



r yj 



(27) 



(28) 



This theorem is proved in Section |V] We next provide a series of corollaries to this result; this 
serves to illustrate the manner in which several related results in the literature follow as special 
cases of Theorem 14.11 

Corollary 4.2: In the case where either r > 2 or p > 2, the growth rate of the weight 
distribution is given by 



T 

G(a) — — a log a + 0(a) 

V 



(29) 



where T > 0. 



Thus if either r > 2 or p > 2, we have a* > and the ensemble sequence exhibits good 
growth rate behavior. This generalizes results along this line in [fl4ll . |fl"5ll , GDI . A special case 
of Corollary 14.21 is as follows. 

Corollary 4.3: Suppose r > 2 or p > 2 and also U teYv Pt — {(hj)} f° r a single point 
i.e. a single point achieves the minimum in < fT9l although this may be manifest in 
different VN types t e Y v . Then 



T 



G(a) = — a log a + Ka + 0(a^) 
V 



(30) 



where K is given by 



K= l - 

i 



log I i^B^jdt 



t&Y v 



-logC7+ - log 

r ip \ i 



T 

1 



(31) 
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Proof: This result follows by making the appropriate substitutions in Theorem 14.11 and 
noting that in this case Qi(x) and Q 2 (x) are monomials satisfying Q2{Qi 1 {^)) = i/j- ■ 

Corollary 4.4: Consider a GLDPC code ensemble with irregular CN set and irregular VN 
set (i.e. different VN degrees). Let r denote the smallest minimum distance of the CNs, and p 
denote the minimum VN degree. Then 

G(a) = (p - £ - l) a log a + Ka + O(a^) , (32) 



| + - log C + ^ log I 



where 

= lot? I pi I + 

r ^ y e 

where 5 represents the fraction of VNs of degree p. 

Proof: In this case, each VN type t E I v satisfies B^(x, y) = l + xy Pt . Let t G I v represent 
the VN type with minimum length (degree), i.e., pi = p, and note that 5 = 5f, then Y v = {t} and 
U t€ y v Pt = Pi — {(1;P)}- Application of Corollary 14.31 then directly yields the required result, 
where we use the fact that T = p — tp in this case. ■ 

This provides a generalization of the result of lfT4l which derived d32l) for the case of GLDPC 
codes with regular CN sets and irregular VN degrees, and which did not include the result (1331) 
regarding the evaluation of the parameter K. 

Corollary 4.5: Consider a D-GLPDC code ensemble Ai n satisfying r = p = 2. Then the 
growth rate of the weight distribution is given by 

1 



G(a) = a log 



+ 0(a § ), (34) 



_P- 1 (1/C). 

where the polynomial P(x) and the parameter C are given by (|24|) and (1251) respectively, and 
where 

c ■ \ f P \ J 2 if r > 3 orp> 3 

^ = min h'9' 2 r , (35) 

12 2 } 3/2 otherwise. 



Proof: When r = p = 2, we have T = and tp = 2; also by (1201) we have 

X = min {iTi ,} = min {j — ^} = p — 2 , 

(i,j)es-.T itj >o ,J (i,j)es-.j>2 
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which implies that xli* + 1 = p/2. Also, it may be verified that Q 1 = P 1 (1/C) in this 
case. ■ 

Corollary 14.51 first appeared in E4l . A necessary and sufficient condition for a D-GLDPC 
ensemble satisfying r = p = 2 to have good growth rate behavior follows in a straightforward 
manner as shown next. 

Corollary 4.6: Consider a D-GLPDC code ensemble A4 n satisfying r = p = 2. Then, a 
necessary and sufficient condition for Ai n to have good growth rate behavior is 

C ■ V < 1 (36) 

where C and V are given by (|25l) and (|26l) respectively. 

Proof: From (|34|) the necessary and sufficient condition is 1/P _1 (1/C) < 1; rearranging 
and using the monotonicity of P(x) yields the result. ■ 

It is worth noting that an analogous theorem to Theorem 14.11 holds also for the stopping set size 
distribution of irregular D-GLDPC codes. The proof is almost identical to that of Theorem 14.11 
and is outlined in Appendix [Bj 



A. Discussion 

From Theorem 14. II and from the definitions of Q\(x) and Q2(x) given in ([22]) and (1231) respec- 
tively, we observe that the triples (t, i,j) (or, equivalently, VN input-output weight enumerating 
function coefficients E>fj) such that (i, j) lies in one of the sets P t (t G Y v ) make a dominating 
contribution to the growth rate for values of a close to zero. We will refer to the set of such 
triples as the dominant set. Note that the dominant set may equivalently be described as the set 
of triples j) such that = T for some t G I v , (i, j) G . 

Interestingly, the dominant set admits an instructive graphical interpretation and may be easily 
identified using a very simple geometric construction. In the plane, a line C through the 
fixed point (0, ip) is rotated in an anticlockwise fashion until it comes in contact with one or 
more of the points (i, j) G S~. The slope of the line C at this point is the parameter T defined 
in (fl9l ), the set of t G I v which have points (i, j) G C is the set defined as Y v , and for each such 
t the set of such points on C is the set P t defined in (|2Tj) . Note that due to CHK the position 
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Fig. 2. Diagram of the VN input-output weight enumerating functions and growth rate dominant set in the plane. The 
illustration is for a smallest CN minimum distance of r — 3, so ip = 3/2. The sets for the two VN types t 6 I v = {1, 2} 
are illustrated by open circles and filled diamonds respectively. The line C has a fixed point at (0, ip) and is rotated in an 
anticlockwise fashion until it touches any of these points. In this example, this occurs at the point (3,3) £ S± . Therefore in 
this example T = 1/2, Y v = {1}, Pi = {(3,3)} and for small values of a the dominating contribution to the growth rate 
comes from weight-3 local codewords of the type-1 VNs generated by weight-3 local input words. 



of the fixed point (0,^) depends only on the smallest CN minimum distance r, and always lies 
somewhere on the line segment joining (0, 1) and (0, 2), including the latter endpoint. 

This interpretation is illustrated in Fig. [2] for an example D-GLDPC code with two VN types 
I v = {1,2}. Type 1 G I v (open circles) corresponds to the Hamming code of length q\ = 7, 
dimension k\ = 4, minimum distance pi — 3 and represented by the systematic generator matrix 
G H = [1000111,0100110,0010101,0001011]. Type 2 e I v (filled diamonds) corresponds to the 
first-order Reed-Muller code of length q 2 = 8, dimension fc 2 = 4, minimum distance p 2 = 4 and 
represented by the generator matrix G RM = [11111111, 11110000, 11001100, 10101010]. In the 
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specific case of Fig. [2] we have ip = 3/2, which corresponds to r = 3o It may be deduced from 
the figure that for values of a close to zero the growth rate G(a) is dominated by weight-3 local 
codewords of the Hamming VNs associated with weight-3 local input words. 

From Corollary 14.51 we observe that in the special case where r = p = 2, the growth rate 
depends only on the CNs and VNs with minimum distance equal to 2, and ( 1341 ) is a first-order 
Taylor series around a = which directly generalizes the results of [fTT]] and lfl5l (for irregular 
LDPC and GLDPC codes respectively) to the case of irregular D-GLDPC codes. Note that the 
error term in (|34|) is O(ct^), which is different to the error term 0(a 2 ) reported in previous 
literature (Theorem 4.1 in j[24|. Theorem 5 in 0T]); this difference is manifest in the case where 
r = p = 2 and either f = 3 or p = 3. Corollary 14 .5 1 indicates that in the analysis of the asymptotic 
growth rate of the weight distribution, the parameter 1/P _1 (1/C) in the context of D-GLDPC 
codes plays an analagous role to the parameter A'(0)p'(l) for irregular LDPC codes, and to 
the parameter A'(0)C for irregular GLDPC codes. It discriminates between ensemble sequences 
with good growth rate behavior, for which 1/P _1 (1/C) < 1, and ensemble sequences with bad 
growth rate behavior, for which 1/P _1 (1/C) > 1. 

V. Proof of the Main Result 

In this section, Theorem 14.11 is proved. For ease of presentation, the proof is broken into four 
parts. 



A. Number of check-valid assignments of weight em over 7m CNs of type t G I c 

Consider 7m CNs of the same type t E I c . Using generating functions, the number of check- 
valid assignments (over these CNs) of weight em is given bye 



Coeff 



(AV{x)) 



2 For example, assuming a uniform CN set composed of (15, 11) Hamming codes, and assuming Ai = 7/15 and A2 = 8/15, 
through Q this would represent a rate R = 1/2 ensemble. 

3 Here we make use of the following general result 1251 . Let a; be the number of ways of obtaining an outcome i £ Z in 
experiment A, and let bj be the number of ways of obtaining an outcome j 6 Z in experiment B. Also let be the number 
of ways of obtaining an outcome in the combined experiment (A, B) with sum i + j — k. Then the generating functions 
M x ) = J2i a i xi > B{x) = J2i bjX j and C{x) = J2k c ^ xk are related by C{x) - A{x)B{x). 
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where Coeff [p(x),x c ] denotes the coefficient of x c in the polynomial p(x). We now use the 
following result, the proof of which appears in ifTOl Appendix A]: 

Lemma 5.1: Let A{x) = 1 + J2t= c AuX u , where 1 < c < d, be a polynomial satisfying 
A c > and A u > for all c < u < d. For a fixed positive rational number £, consider the set of 
positive integers I such that (leN and Coeff [(A(x)) 1 ,x^] > 0. Then either this set is empty, 
or it has infinite cardinality; if t is one such £, then so is jt for every positive integer j. In the 
latter case, the following limit is well defined and exists: 

lim ~ log Coeff (A(x)Y , x il = max ^ ft log (^j (37) 

where [/ = {i E N : > 0}, /3 = (ft)i e t/> an d me maximization is subject to the constraints 

E ief / A = 1, E i6C / = £ and ft > for all i E U. 

Applying this lemma by substituting A(x) = A®(x), i = 7m and £ = e/7, we obtain that with 
7 fixed, as m —7- 00 

N ( J l \em) = Coeff f(i w (i)) 7m , x em (38) 



/// 




exp < 7717 max V" fif ) log -j-r } (39) 
= exp{mW t V7; (e)j 

where the maximization over (3® = (ft- )ieu t is subject to the constraints ^2 ieU ft- = 1, 
Siet/- Z 'A = e /l an ^ ft-*'' > for all i E U t (recall that the sets U t and are given by © 
and CUB). 



5. Number of check-valid assignments of weight 5m 

Next we derive an expression, valid asymptotically, for the number of check- valid assignments 
of weight 5m. For each t E I c , let e t m denote the portion of the total weight 5m apportioned to 
CNs of type t. Then e t > for each t E I c , and Yltei e * = ^- Also denote e = (ei e 2 ■ ■ • e n J. 
The number of check-valid assignments of weight 5m satisfying the constraint e is obtained by 
multiplying the numbers of check-valid assignments of weight e t m over 7 t m CNs of type t, for 
each t E I c , 

NP (5m) = J] N% m) (e t m) (41) 
17 



where the fraction 7 t of CNs of type t E I c is given by ©. 

The number of check-valid assignments of weight 5m, which we denote N c (Sm), is equal to 
the sum of N { c e) (5m) over all admissible vectors e; therefore, as m — > oo 

N c (5m) S explmJ2W t M (e t ) I (42) 

where we have used (|40l) and (|4T1) . However, the asymptotic expression as m — > oo is dominated 
by the distribution e which maximizes the argument of the exponential. Therefore as m — > oo 

N c (5m) S exp {mf} (43) 

where 



W = max •£ It m« £ ft«> log -L , (44) 
teic p ieu t \Pi / 

the maximization over e is subject to the constraint 

J> = 5, (45) 

teic 

and for each t 6 J c the maximization over = (/^ )i e t/ t is subject to the constraints 

XX> = 1 (46) 



and 



Next, for each t e J c we define 



iec/; 



/# > Vi £ Z7 t . (48) 



W (t) )=^f-^)-EA w 



We then have the following lemma. 

Lemma 5.2: The expression ^ te j 7t-Fi(/3 ) is 0(5 2 ) for any satisfying the optimiza- 
tion constraints (l45T)-(l48T). 

4 Observe that asm-) oo, exp(mZt) « exp(mmaxt{Zt}) 
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A proof of this lemma is given in Appendix lAl It follows from Lemma 15^21 that the expression 
Eier.Tt-W^) is °0 2 ) for the maximizing (3 {t) . Therefore 



W = max > 7+ max 



E^M^l+W": 



2\ 



= max ^ 7t max log +0(5 

«e/ c ^ ie[ 7- V Pi / 

where the maximization over (3^ = {fif ) ieU - (for each t G J c ) is subject to the constraint ( |47T ) 
together with > for all i G £/ t ~. In what follows, for convenience of presentation we shall 
temporarily omit the 0(5 2 ) term in the expression for W. 

Next we make the substitution 9® = ^tPi f° r all t G I c , i G This yields 

W = max V max V # log [ 

tela ° ie u- V °i 

where the maximization over 0® = {9f ) ieU - (for each t G I c ) is subject to the constraints 
^2 ieU - iOf = e t and 9f' > for all i G Uf . We observe that this maximization may be recast 
as 



W = maxj^ flflog 

tele i£U7 







it) 



where by (|45T) the maximization, which is now over = {9\ ) t ei c ,ieu~' ^ s su bject to the 
constraints 

E E «f = * 

and ef ] > for all t G J c , i G C/ t ". 

Making the substitution i> s . = 0^/5 for all £ G J c , z G Uf, we obtain 

w = 6 m ,f x E E ^ lo § ( ) (49) 



t£lc i£U. 



where the maximization over v = (vf ) teIc ieU - is subject to the constraints 

E E ^ = 1 (so) 



and vf ] > for all t G I c , i G Iff. 
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Solving the constrained optimization ( |49l ) using Lagrange multipliers yields 

v ® = -LJL e -* \/tel c ,ieUf , (51) 

where A is the Lagrange multiplier. Substituting (l5TT) into (l50l) and defining z = e~ A yields 

X)X)<^*V = *- (52) 

We may write this as 

J]r4S* r + E E *4*V = * (53) 
tGX c te/ c ieuf\{r} 

from which we obtain (since all coefficients are positive and z = e~~ x > 0) 

t€X c 

and therefore 

2 < ! ttt; * 1/r (54) 

(E««.^?'7.) 

is valid for all 5 > 0. Thus 

z ~ 0(5 1/r ) . (55) 
Recalling the definition z = e~ x , (ISTI) and (1551) together imply that 

~ O (5 < / r - 1 ) Vt 6/ c ,i6 Z7 t " . (56) 

Next, since the value of u which achieves the maximum in ( |49l ) satisfies (ISTI) . we may develop 
(EH) as 

W = J2 v i t} ( 1 + iX ) (5 7 ) 

*e4 ieur 



where in the second line we have used the constraint ([50] 
Now, the constraint (l50l) may be written as 



i = E E ™. w + E 
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so 



W 1 1 / • \ (t) 



EE 



(59) 



Also, we may write by (pi 



Multiplying by r and summing over all t E X c yields 



6e rX rv? = rA rht ■ 



Extracting A yields 

A 



tex c 



r 8 I ^ „,(*) 



tex c 



i log (£^)_I log (g ro ,.) 

)-i'og(l- £ 

/ \ t,i : i>r 



r l0g 5 



(60) 



where we have used (|5Q|) in the final line. Substituting (l59l) and (l60l) back into (1581) yields 



;)'°4 eE ' 6 1 M '" 7 ' 



E (*- r H + 

i,i : i>r 



9 log {ifr 



.(*) 



(i - r 



log f i - E ^1 
^ g (i - E 

\ t,i : i>r / 



where in the second line we have used ( TT3T ), and in the final line we have used ( |56l ) and the fact 
that log(l + a) ~ 0(a) (also recall the definition ([TBI)). 

Substituting this expression for W into (1431) while recalling the 0(5 2 ) term in the expression 
for W, we have that as m — > oo 



N c (Sm) ~ exp < m 



^l0g +0 ( 5 n«n{f/r,2}^ 

r \5JpJ 

Note that (|6TT) generalizes [11, eqn. (30)] to the case of a generalized CN set. 



(61) 
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C. Number of variable-valid split assignments of split weight (rn, an) over VNs of type 

t e l v 



Consider VNs of the same type t G I v . We now evaluate the number of variable- valid 
split assignments (over these VNs) of split weight (rn, an). Using generating functions, this is 
given b>|f 



N^J ] {rn, an) = Coeff (B {t) {x, y))^ , x Tn y° 

where Coeff \p(x, y), x c y d ] denotes the coefficient of x c y d in the bivariate polynomial p(x,y). 
We make use of the following result, the proof of which appears in ffTOl Appendix A]: 

Lemma 5.3: Let 

k d 

B(x, y ) = i + j2J2 B ^ xU y v 

u=l v=c 

where k > 1 and 1 < c < d, be a bivariate polynomial satisfying B u v > for all 1 < u < k, 
c < v < d. For fixed positive rational numbers £ and 9, consider the set of positive integers t 
such that it G N, 6t G N and Coeff [(B(x, y)) e , x il y et ] > 0. Then either this set is empty, or 
has infinite cardinality; if t is one such i, then so is jt for every positive integer j. Assuming 
the latter case, the following limit is well defined and exists: 



lim - log Coeff 

^->oo r 



(B(x, y)) e , x^y m ) = max £ VlJ log (^A (62) 

" d,j)es V ' / '-' J 

where S = {(i,j) G N 2 : B it j > 0}, 77 = (Vi,j)(i,j)€S> an d the maximization is subject to the 
constraints E (i)i ) e 5^,i = L E(i,i) e s%J = E (i ,j) e si^,j = ® and raj > for all (ij) G S. 



5 We use the following result on bivariate generating functions 1251 . Let atj be the number of ways of obtaining an outcome 
G in experiment A, and let b^j be the number of ways of obtaining an outcome (k, I) € Z 2 in experiment B. Also let 
c P]Q be the number of ways of obtaining an outcome (k, I)) in the combined experiment (.4, B) with sums i + k — p 

and j + l = q. Then the generating functions A(x,y) = J^t.j ^ijx l y\ B(x,y) = J^k.i b k,ix k y l and C(x, y) = J2 p , q c v , q x v y q 
are related by C(x,y) = A(x,y)B(x,y). 
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Applying this lemma by substituting B(x,y) = B^(x,y), £ = 777,, £ = 7-/7 and 6 = a/7, we 
obtain that with 7 fixed, as n — > 00 

N$f{Tn, an) = Coeff [(B (t) (x, y)) 7 " , x rn y an j (63) 

(64) 



« exp J n7 max V tM log ( -g- J 

±exp{nX™(T,<j)} 



(65) 



where the maximization over rj® = {Vij)(i,j)es t ls subject to the constraints Y^(ij)es t ^ij = ■"•> 



,(*) 



(0 



E(ij) e s- = r /7, E(ij)eflT^S = °"/ 7 and ^3 - for a11 ^ G St (reCa11 that the SCtS 
S t and S7 are given by <[TT1> and (fT2l)). 



,(*) 



,(*) 



D. Growth rate of the weight distribution of the irregular D-GLDPC code ensemble sequence 

Recall that the number of check-valid assignments of weight 5m is N c (5m); also, the total 
number of assignments of weight 5m is L E ) . Therefore, the probability that a randomly chosen 
assignment of weight 5m is check-valid is given by 

P*m(5m) = N C (5m)/ ( 

Here we adopt the notation 5m = fin; also we have E = mj J p = nj J X. The binomial 
coefficient may be asymptotically approximated using the fact, based on Stirling's approximation, 
that as n -»■ 00 ifTTll 



rn\ n 



an 



exp < n 



(^) +0(<7 2 



(valid for < a < r < 1) which yields, in this case, 



n/jX 

(3n 



exp < n 



/Slog 



(3fX 



0{f 



as n 00. Applying this together with the asymptotic expression (16Tb . we find that as n — > 00 
(exploiting the fact that 5 J p = /3 J X) 



Pvaiid(/?n) « exp{nF(/3)} 



where 



/Slog 



/9/A 



+ O (/3 min{f/r ' 2} ) . 



(66) 



(67) 
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Next, we note that the expected number of D-GLDPC codewords of weight an in the ensemble 
A4 n is equal to the sum over (3 of the expected number of split assignments of split weight 



(an, i3n) which are both check-valid and variable-valid, denoted N 



v,c 

an, fin' 



[Nan]=J2 E M n [K^ n }. 



This may then be expressed as 



where the fraction 5 t of VNs of type t £ I v is given by © and the second sum is over all 
partitions of a and f3 into n v elements, i.e., we have a t , f3 t > for all t £ I v , and Yltei a t = a ' 

E*/. A = p. 

Now, using (I63l)-(l65l). as n — > oo we have for each t £ I v 



N$ n \a t n, i3 t n) ~ exp <j nX t w K i3 t ) 



where, for each t £ I v , 



Xi st \a t ,p t ) = 8 t max £ nf] log 



5 



(*)• 
' 1 



,(*) 



{i,j)es t \ %j 

and the maximization over tj® = (Vij) (i,j)es t is subject to the constraints 



E < 



(t) 



and 



E Wi] = atl5t 

(i,i)65 t - 

/?S>0 V(i,j)G5i 



Therefore, recalling (|66l , we have that as n — > oo, 



Em„ [JV^J w £ £ exp 



E/9t=/3 



(68) 



(69) 
(70) 
(71) 

(72) 
(73) 
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Next, for each t G I v we define 



Vfo.O/ (i,i)65 t - 



Note that the expression ( 1731 ) is dominated as n — >• oo by the term which maximizes the argument 
of the exponential. Thus using (l67l) and (1681) we may write 



G(a) 



max max 

Eft 



< y^<5 t ma 

a ^ teiV, 



max 

) 



^S lo s 

(i,i)e5 t - 



(*)' 



(*) 



+ F t (r i 



r 



{m)- p]ot (m) +0 ^' l ' M) } (74) 



where the maximization over ry w = (vf]) (%,j)es~ (f° r eacn * e * s subject to constraints (|70b 
and d2B together with r/fj > for all (z, j) G S'f. 
We next have the following lemma. 

Lemma 5.4: The expression Yltei ^tFtiv ) 1S 0(a 2 ) for any rj® satisfying the optimiza- 
tion constraints Jg9b- (|72b . 

The proof of this lemma follows the same lines as the proof of Lemma 15.21 and is therefore 
omitted. It follows from Lemma 15.41 that the expression ^2 teI S t F t (rj^') is 0(a 2 ) for the 
maximizing rj^\ Also, since /3/a is bounded between two positive constants, any expression 
which is 0(/3 K ) must necessarily also be 0(a K ) (where k > 0). Therefore 



G(a) = max max 
E«t= a 

Y.Pt=/3 



J2$t max ^S lo g(^f 



r 



eC 



-/31og 



/3/A 



+ O (Q; min { f / r ' 2 >) 



where the optimization is (as before) subject to the constraints (1701 ) and ( 1711 ) together with 77^] > 



for all (i, j) G S7- In what follows, for convenience of presentation we shall temporarily omit 
the O (a mm { r / r ' 2 }) term in the expression for the growth rate. 

Next we make the substitution 7^ • = 5 t r]f^ for all t G I v , G S^. This yields 



G(a) 



max max 

Eft=/3 



^max £ ^S lo S 



(<J)eS t - 



(*) 



/3 



log 



eC 



/Slog 



/3/A 
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where the maximization over 7^ = (lij)(ij) e s~ (f° r eacn * e ^v) is subject to the constraints 
maximization may be recast as 



E(ij)es- *tS = a t, J2(i,j)es- nlj = fit, and 7 g > for all G S t . We observe that this 



G(a) = max 



E E 



»>3 



/3(7) 



(*) 

7j 



log 



r ta V/3(7)/A 



■^(7) log 



/3(7)/A, 



where the maximization, which is now over 7 = (lij)tei v ,{i,j)eS^' * s subject to the constraints 



(t) 



£ £ = 

te/„ (»j)e5 t - 



and 7- j > for all t G I v , (i, j) G S t , and where 



(t) 



0(7) = £ £ ^ 



Making the substitution = jfj/a for all t G J^, (z, j) G 5 t , we obtain 



G(a;) = a max 



E E -S'oi 



(*) 



+ ^log 



lo£ 



az 



where the maximization over 1/ = {^f]) teIv (ij)es~ * s su bj ect t0 me constraint 



E E -S = i 



as 



well as > for all t G I v , (i, j) G S t , and where 



2 1/ 



E E i 



v 



(t) 



By solving (1731) directly using Lagrange multipliers, one shows that 

1 



log 



h3 



av. 



(t) 
1 3 



J lQg f g 



jlog 



az 



Ai 



(75) 



(76) 



(77) 



(78) 
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holds for all t E I v , E S t ; here A denotes the Lagrange multiplier. Substituting (1781) back 
into ([75]) yields 

C \ ( 1 



G(a) 



a 



E E 



v 



(*) 



+ 



eC 

log 



r 



az 



1 + Az - - lof 



zff) lot 



jlog 



az 



(i/) /A 



0-2(1/)/ A 



a 



a 



E E ^+a + ^-,m 



1 



A -sE E 

where in the second line we have used (l76l) and (1771) . Note that (1781) may be rearranged as 



(79) 



av% = B^C* " ( az(i/) / A 



(80) 



where u = e A is a positive real number. Substituting this solution into (1771) (i.e. into the 
definition of z(u)) and using © yields 



E7 E iB®C» (azW) f 
and similarly the constraint ( 1761 ) may be written as 



Co' 



(81) 



j-i> 



e^e^H/a)^-^ 

(0 



(82) 



We next proceed by proving an upper bound on all terms v\ - such that (t,i,j) does not lie 



in the dominant set discussed in Section HV-A[ i.e., on all terms vj 9 where £ G /„, G S7 
and Ty > T. To this end, fix s E I v , (k, I) E S~ with T k j = T, and consider arbitrary t E I v , 
E (for which we have > T). Then applying (1781) in the two pertinent cases yields 



A = - log 

1 



(*) 



<xz(f ) / A 
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and 



A = -lo, 



av 



k.l 



az{is) l A 



Equating these two expressions for A we obtain 



- log vf] - — log I 

i ' J k 

where we define the function 



0) 



1> 



log a + F«j (a) -Fff (a) 



(«), 



F$(a) = -log 



B f) 5 C 3/r ( z{u) / A 



1,3 



3 /*l>' 



(83) 



for every t G I v , G S t (it is easy to check that this is indeed a function of a). Note that 
z{y) is bounded above and below as a — > 0; this may be easily shown since (1761) and (1771) imply 

< ( min % ) V V z/f] < 1< ( max i ] V V i/$ 



t&v (i,j)eS, 



and 



<( { ™^')E E €<^)<( 



max 7 



teJ, (i,i)e5 t - 

E E 

(i,j)es; 



(*) 



respectively. Thus it follows that for any t G I v , G Sj~, the function F>j(a) given by ([831 
is bounded above and below as a — > 0. Therefore 



M ■ 



h3_ 

(»), 



(84) 



where r£j(a) = exp(F-^(a)) > for every t G /„, (z',j) G S t . We next write ([841) as 

^ = (^r /fc A5^(a)a^ 
where for conciseness we have defined 



(85) 



rg(«). 



> o 



Substituting into (1761) we obtain 



(i,j)es- 



a ) a * 



which in particular implies (since all terms involved are nonnegative, and T^j = T for all t G Y v , 

(hj)eP t ) 

E E 



^?r /fc A£i(«) < i 



ten (ij)ePt 
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If we now define % = max{i : (i,j) G P t for some t G Y v }, the previous inequality leads to 



v kl — 



Since, for every triple (t, F^(a) is bounded above and below, so are (a) and Af'/l 
Therefore, the previous inequality implies 



1 



k/i 



(t,s) 
,,j,k,l 



a 



(t). 



(t,s) 



a . 



"8 - o(i) 



(86) 



which holds for any s G Y v , (k, I) G P s , i.e., for any triple (s, k, I) lying in the dominant set. 



Recalling (1831) . for general vfj we have 



for all t G I v , G S t . 



(87) 



Next observe that (f8~7l) may be used to upper bound cu = e A . In fact, it may be seen that 
applying (l80l) for any triple (t,i,j) and taking into account (1871) leads to 



(t) 



i.e., 



Bf]8 t Ci/* (az{v)f\y/1> 

T_ 

u ~ 0(a * ) . 



o{or^) 



(88) 



Next, ([871) implies that 



(*) 

I/- ■ 



E E ir "S+E E <p«-rK 

teA, (i,j)es- tei v (i,j)es~ 

T 1 



te/„ (i,j):Tij>T 



(89) 



where in the second line we have used (l76l) . and in the final line we have used (f8~7T) and also 
recalled the definition (1201) of the parameter 

Next note that, recalling (l22l) and (l23l) . equations (f8~TI) and (1821) may be written as 



and 



Qi (y) + h(a) 



Q2 (2/)+f 2 (a) 



(90) 



(91) 
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respectively, where we define 



(92) 



and 



««)=Er E iB S c '" 

te/„ Vi (i,j):Tij>T 

Note that since 2(1/) is bounded below as a — > 0, from (|88l ) we have fi(a) ~ O(a^) and 
f 2 (a) ~ 0(a*). Recalling that u = e~ x , from (|92| ) we obtain the following expression for the 
Lagrange multiplier A: 

T T T 
A = - + -loga + -log z(u) -logy . (93) 

ip ip yi 

From (|90b and ( |9T| ) this latter expression may be written as 

A = T - + ^ log a - - log [Q 2 {Q- X \\ - h(a))) + f a (a)] - log [Qr X (l - ■ 
Using the Taylor series of + x ) aroun d x = 0, we have 



<xz(z/) / A I u 



az(v) / A I u; 



gr 1 (i-f 1 («)) = gr 1 (i) + gi 



(94) 



where 



gi(o) 



a) + 0(f 1 2 (a)). 



Note that since fi(a) ~ 0{a*), we also have gi(a) ~ O(a^). Substituting the obtained 
expression (l94l) for — fi(«)) into the previous expression (|931) for A we obtain 



A = I + ^ log a - ^ log [©2(0^(1) + gi(«))) + - log [Q^l) + gi' 



We may now develop Q2{Q\ + gi(«))) using the Taylor series for Q 2 {Qi + a;) around 
x = and log [Qj" 1 (l) + gi(a)] using the Taylor series for log(Qj~ 1 (l) + x) around x = 0. We 



obtain 



q 2 (qt 1 (i) + gi(a))) = g 2 (Qr 1 (i)) + g 2 («) 



where 



g 2 («) = ^(gr 1 (i))-gi(«) + o( g ?(a)) 
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and 

log [Q^(l) + gl (t*)] = logg^(l) + hx(a) 

where 

h 1 (a) = ^= T 7 i yg 1 (a) + 0(g?(a)). 

Again, note that since gi(a) ~ O(a^), we have g 2 (a) ~ O(a^) and hi (a) ~ O(a^). Therefore, 
we have 

a = ^ + ^ log a - ^ log [g 2 (gr x (i)) + &(<*) + f 2 («)] + io g — ^— - hi(«) . 

ip Qi (l) 

Finally, we develop log [g 2 (Qr 1 ( 1 )) + 82(a) + f 2 (a)] using the Taylor series for log [Q 2 (Qi 1 (l)) + x] 
around x — 0. We obtain 

log [Q 2 (Q^(1)) + g 2 (a) + f 2 (a)] = \ogQ 2 {Q^{\)) + h 2 (a) 

where 

^ = 7777FT7TTv(g 2 («) + W) + 0((&(a) + f 2 («)) 2 ) . 

Using this expression we obtain 

T T T 1 IT 

A = - + - loga + - log + log — y— - - -h 2 (a) - hi (a) . 

V> v v g 2 (g i 1 (i)) Qi (i) i> 

Since g 2 (a) ~ O(a^) and f 2 (a) ~ O(a^), we have h 2 (a) ~ O(a^); hence, we obtain 

T T T 1 1 

A = - + -loga+ -log— — + log— 17- + 0(a$) . (95) 
^ ^ ^ QiiQi (l)) <5i (1) 

Finally, substituting ([89]) and d5U) into d79j and recalling the O ( a min{f/r ' 2} ) term completes 
the proof of the theorem. 

VI. Conclusion 

A compact expression for the asymptotic growth rate of the weight distribution of irregular 
D-GLDPC codes for small linear-weight codewords has been derived. Ensembles with check or 
variable node minimum distance greater than 2 are shown to have good growth rate behavior, 
while for other ensembles an important parameter is identified which discriminates between 
good and bad growth rate behavior of the ensemble. This generalizes known results for LDPC 
codes and GLDPC codes, and also generalizes the corresponding connection with the stability 
condition over the BEC. 
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Appendix A 
Proof of Lemma [572] 

Consider any which satisfies the optimization constraints (|45l)-(|48l). From constraint (|45l) , 
S small implies that e t is small for every t G J c . From constraint (|47T) we conclude that flf' is 
small for every t G J c , z G C/ t ~, and so /?q*' ) is close to 1 for all t G J c . Formally, for any i G 7 C 
the term in the sum over i G U t in (144)) corresponding to i = may be written as (here we use 
(l46l) . and the Taylor series of log (1 — x) around x = 0) 

1 



0<'> log 



.A 



(*) 



(EA" , -i)'og(i-Eft ( ' ) ) 

(Ea , "->)(-Ea'"+°((Ea i 



Therefore we have 



*) = o((EA ( 



i.e. 



(*)V 



(96) 



for some k t > independent of {[if } ig[7 -. It follows that 



£^(/3 (t) ) 






; e a" 1 ) 2 


(97) 




teic 




ieur 





where = k t jt for each t G I c . Also, by (l47l) we have ^2 ieU - /3® < (-tilt and therefore 



E<Ea ( ")^E(|) j 



(98) 



*e/ c jet/- te/ c 
Denote 7 = max te / c {7j/7 t 2 }; then, combining (|9~71 ) and (l98l) . 



tela \teic / 



7 5 2 



and thus the expression Yltei i t Ft{(3^) is 0{5 2 ), as desired. 
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Appendix B 

Growth Rate of the Stopping Set Size Distribution 

This appendix illustrates how the approach developed to analyze the growth rate of the weight 
distribution of D-GLDPC codes can also be used to analyze the growth rate of the stopping set 
size distribution. The concept of stopping set was introduced in ll26ll within the context of iterative 
decoding of LDPC codes over the BEC. A stopping set of an LDPC code is defined as any subset 
of the VNs such that if a CN is connected to it, it is connected to it at least twice. Over the 
BEC, stopping sets under iterative decoding play the same role as codewords under maximum 
likelihood decoding. 

Stopping sets can also be defined within the context of D-GLDPC codes over the BEC. In 
contrast to the case of LDPC codes however, here the definition of stopping set is not unique as 
it depends on the decoding algorithm used at the VNs and CNs to locally recover from erasures. 
In the following, we assume maximum a posteriori (MAP) erasure decoding is used at both the 
VNs and the CNs. 

Consider an (n, k) linear block code and a generator matrix G for this code. Moreover, 
consider a k-bii information word u containing erasures. Encoding of this information word 
produces an n-bit word x containing erasures, where the non-erasure encoded bits are those 
depending only on the non-erasure information bits (through the relationship x = uG). We say 
that the erasure pattern on the encoded bits is induced by the erasure pattern on the information 
bits. 

Consider an (s, h) CN of a D-GLDPC code over the BEC, and let G denote a generator 
matrix for this CN. A local stopping set for this CN is a subset of the local code bits such that, 
if all of these bits are erased, MAP decoding cannot recover any of these bits. This occurs if and 
only if each column of G corresponding to erased bits is linearly independent of the columns of 
G corresponding to the non-erased bitsj. The size of the local stopping set for the CN is equal 
to the number of erased local code bits. 

Next consider a (q, k) VN of a D-GLDPC code over the BEC, and let G denote the generator 
matrix for this VN, i.e. G expresses the relationship between the local information word and 
the local codeword whose bits are associated with the Tanner graph edges. A local stopping set 

6 Note that the definition of local stopping set is independent of the particular generator matrix chosen for the CN. 
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for this VN is a subset of the local information bits, together with a subset of the local code 
bits, such that if all of these bits are erased, MAP decoding at the VN cannot recover any of 
these bits. Since MAP decoding over the BEC consists of running Gaussian elimination on the 
(k x (q + k)) matrix G' = [G | where is the (k x k) identity matrix, this occurs if and 
only if each column of G' corresponding to an erased bit is linearly independent of the columns 
of G' corresponding to the non-erased bits. The split size of a VN stopping set is equal to (u, v) 
where there are u erasures among the local information bits, and v erasures among the local 
code bits. 

The concepts of assignment and split assignment remain valid also within the present context 
of stopping sets. However we redefine the concepts of check-valid assignment, check-valid split 
assignment and variable-valid split assignment as follows. 

Definition B.l: An assignment is said to be check-valid if the following condition holds: 
supposing that each edge of the assignment carries an erasure and each of the other edges carries 
a non-erasure, each CN recognizes a local stopping set. 

Definition B.2: A split assignment is said to be check-valid if its assignment is check-valid. 
A split assignment is said to be variable-valid if the following holds. Supposing that each edge of 
its assignment carries an erasure and each of the other edges carries a non-erasure, and supposing 
that each D-GLDPC code bit in the codeword assignment is set to an erasure and each of the 
other code bits to a non-erasure, each VN recognizes a local stopping set, where the erasure 
pattern on its local code bits is that induced by the erasure pattern on its local information bits. 

A stopping set of a D-GLDPC code may be defined as a codeword assignment such that the 
split assignment formed by the codeword assignment and the corresponding induced assignment 
is both variable-valid and check-valid. The size of such a stopping set is equal to the number 
of elements in the codeword assignment. The growth rate of the stopping set size distribution is 
defined as in ([8]) , where in this context N an denotes the number of stopping sets of size an. 
We define the stopping set size enumerating polynomial for CN type t E I c by 

u=rt 

where ifu denotes the number of local stopping sets of size u for CNs of type t. Letting G t 
denote any generator matrix for CN type t, note that tfu is equal to the number of ways of 
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choosing u columns of G t such that each of the selected columns is linearly independent of the 
s t — u non-selected columns. 

We also define the bivariate stopping set split size enumerating polynomial for VN type t £ I v 

by 

u=l v=p t 

where denotes the number of local stopping sets of split size (u, v) for VNs of type t. 
Letting G t denote the generator matrix for VN type t, note that is the number of ways 
of choosing u columns of 1^ and v columns of G t such that each of the selected columns is 
linearly independent of the k t + q t — v — u non-selected columns of G' t = [G t \ 1^]. 

With these definitions in place, the analog of Theorem 14.11 may be developed in an identical 
manner for the growth rate of the stopping set size distribution. The proof is identical to that 
developed in Section |V] for the growth rate of the weight distribution, with the stopping set size 
enumerators in place of the weight enumerators, the new definitions of check- and variable- 
validity, and "stopping set size" in place of "codeword weight". In particular, we have the 
following result for the case r = p = 2, which is the most important case in practice. 

Theorem B.l: Consider a D-GLPDC code ensemble M. n satisfying r = p = 2. Assume the 
code is transmitted over the BEC, and it is decoded via iterative decoding with MAP erasure 
decoding at the VNs and CNs. The growth rate of the stopping set size distribution is given by 



G(a) = a log 



+ 0(a 3/2 ) (99) 



_P- 1 (1/C). 

where the polynomial P(x) and the parameter C are given by (|24|) and (1251) respectively. 

The proof of this result is completed by observing that, if r = p = 2, we have (p® = A% for 
each t £ X c and i?^ 2 = B® 2 for each t £ X v , u — 1, 2, • • • , k t . Note that, in contrast to (1341) . the 
error term is always 0(« 3 / 2 ) in (l99l because in the context of stopping sets, p = r = 2 implies 

p = f = 3. 

Finally, note that (l99l generalizes to D-GLDPC codes the expression G(a) = a log(A'(0)//(l)) + 
o(a) obtained in ll27l for the growth rate of the stopping set size distribution of LDPC codes. 
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